KAAS: an automatic genome annotation and pathway reconstruction server
نویسندگان
چکیده
The number of complete and draft genomes is rapidly growing in recent years, and it has become increasingly important to automate the identification of functional properties and biological roles of genes in these genomes. In the KEGG database, genes in complete genomes are annotated with the KEGG orthology (KO) identifiers, or the K numbers, based on the best hit information using Smith-Waterman scores as well as by the manual curation. Each K number represents an ortholog group of genes, and it is directly linked to an object in the KEGG pathway map or the BRITE functional hierarchy. Here, we have developed a web-based server called KAAS (KEGG Automatic Annotation Server: http://www.genome.jp/kegg/kaas/) i.e. an implementation of a rapid method to automatically assign K numbers to genes in the genome, enabling reconstruction of KEGG pathways and BRITE hierarchies. The method is based on sequence similarities, bi-directional best hit information and some heuristics, and has achieved a high degree of accuracy when compared with the manually curated KEGG GENES database.
منابع مشابه
Biennial Report on Carcinogens Listing/Delisting Procedure
The number of complete and draft genomes is rapidly growing in recent years, and it has become increasingly important to automate the identification of functional properties and biological roles of genes in these genomes. In the KEGG database, genes in complete genomes are annotated with the KEGG orthology (KO) identifiers, or the K numbers, based on the best hit information using Smith– Waterm...
متن کاملFrom Function Prediction to Pathway Prediction: A New Pipeline Based on KAAS and GENIES
The number of complete and draft genomes has increased in recent years. The prediction of precise biological roles of the genes of such sequenced organisms is becoming an important issue in computational biology. We have recently developed two novel systems: KAAS (KEGG Automatic Annotation Server) [2, 4] and GENIES (Gene Network Inference Engine based on Supervised Analysis) [3] as computationa...
متن کاملKAAS: KEGG Automatic Annotation Server
The number of complete and draft genomes has rapidly increased in recent years, and it has become increasingly important to identify the functional properties and biological roles of genes in these genomes. We have been developing KEGG Orthology (KO) to classify gene functions. In KO, we annotate genes in complete genomes based on best-hit information using Smith-Waterman scores, as well as by ...
متن کاملA Parsimony Approach to Biological Pathway Reconstruction/Inference for Genomes and Metagenomes
A common biological pathway reconstruction approach -- as implemented by many automatic biological pathway services (such as the KAAS and RAST servers) and the functional annotation of metagenomic sequences -- starts with the identification of protein functions or families (e.g., KO families for the KEGG database and the FIG families for the SEED database) in the query sequences, followed by a ...
متن کاملCycADS: an annotation database system to ease the development and update of BioCyc databases
In recent years, genomes from an increasing number of organisms have been sequenced, but their annotation remains a time-consuming process. The BioCyc databases offer a framework for the integrated analysis of metabolic networks. The Pathway tool software suite allows the automated construction of a database starting from an annotated genome, but it requires prior integration of all annotations...
متن کامل